67 research outputs found

    PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical Domain

    Full text link
    Biomedical language understanding benchmarks are the driving forces for artificial intelligence applications with large language model (LLM) back-ends. However, most current benchmarks: (a) are limited to English which makes it challenging to replicate many of the successes in English for other languages, or (b) focus on knowledge probing of LLMs and neglect to evaluate how LLMs apply these knowledge to perform on a wide range of bio-medical tasks, or (c) have become a publicly available corpus and are leaked to LLMs during pre-training. To facilitate the research in medical LLMs, we re-build the Chinese Biomedical Language Understanding Evaluation (CBLUE) benchmark into a large scale prompt-tuning benchmark, PromptCBLUE. Our benchmark is a suitable test-bed and an online platform for evaluating Chinese LLMs' multi-task capabilities on a wide range bio-medical tasks including medical entity recognition, medical text classification, medical natural language inference, medical dialogue understanding and medical content/dialogue generation. To establish evaluation on these tasks, we have experimented and report the results with the current 9 Chinese LLMs fine-tuned with differtent fine-tuning techniques

    Contrastive Triple Extraction with Generative Transformer

    Full text link
    Triple extraction is an essential task in information extraction for natural language processing and knowledge graph construction. In this paper, we revisit the end-to-end triple extraction task for sequence generation. Since generative triple extraction may struggle to capture long-term dependencies and generate unfaithful triples, we introduce a novel model, contrastive triple extraction with a generative transformer. Specifically, we introduce a single shared transformer module for encoder-decoder-based generation. To generate faithful results, we propose a novel triplet contrastive training object. Moreover, we introduce two mechanisms to further improve model performance (i.e., batch-wise dynamic attention-masking and triple-wise calibration). Experimental results on three datasets (i.e., NYT, WebNLG, and MIE) show that our approach achieves better performance than that of baselines.Comment: Accepted by AAAI 202

    From Discrimination to Generation: Knowledge Graph Completion with Generative Transformer

    Full text link
    Knowledge graph completion aims to address the problem of extending a KG with missing triples. In this paper, we provide an approach GenKGC, which converts knowledge graph completion to sequence-to-sequence generation task with the pre-trained language model. We further introduce relation-guided demonstration and entity-aware hierarchical decoding for better representation learning and fast inference. Experimental results on three datasets show that our approach can obtain better or comparable performance than baselines and achieve faster inference speed compared with previous methods with pre-trained language models. We also release a new large-scale Chinese knowledge graph dataset AliopenKG500 for research purpose. Code and datasets are available in https://github.com/zjunlp/PromptKG/tree/main/GenKGC.Comment: Accepted by WWW 2022 Poste

    Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction

    Full text link
    Recent neural-based relation extraction approaches, though achieving promising improvement on benchmark datasets, have reported their vulnerability towards adversarial attacks. Thus far, efforts mostly focused on generating adversarial samples or defending adversarial attacks, but little is known about the difference between normal and adversarial samples. In this work, we take the first step to leverage the salience-based method to analyze those adversarial samples. We observe that salience tokens have a direct correlation with adversarial perturbations. We further find the adversarial perturbations are either those tokens not existing in the training set or superficial cues associated with relation labels. To some extent, our approach unveils the characters against adversarial samples. We release an open-source testbed, "DiagnoseAdv" in https://github.com/zjunlp/DiagnoseAdv.Comment: IJCKG 202

    LOGEN: Few-shot Logical Knowledge-Conditioned Text Generation with Self-training

    Full text link
    Natural language generation from structured data mainly focuses on surface-level descriptions, suffering from uncontrollable content selection and low fidelity. Previous works leverage logical forms to facilitate logical knowledge-conditioned text generation. Though achieving remarkable progress, they are data-hungry, which makes the adoption for real-world applications challenging with limited data. To this end, this paper proposes a unified framework for logical knowledge-conditioned text generation in the few-shot setting. With only a few seeds logical forms (e.g., 20/100 shot), our approach leverages self-training and samples pseudo logical forms based on content and structure consistency. Experimental results demonstrate that our approach can obtain better few-shot performance than baselines.Comment: Work in progres

    Construction and Applications of Billion-Scale Pre-trained Multimodal Business Knowledge Graph

    Full text link
    Business Knowledge Graphs (KGs) are important to many enterprises today, providing factual knowledge and structured data that steer many products and make them more intelligent. Despite their promising benefits, building business KG necessitates solving prohibitive issues of deficient structure and multiple modalities. In this paper, we advance the understanding of the practical challenges related to building KG in non-trivial real-world systems. We introduce the process of building an open business knowledge graph (OpenBG) derived from a well-known enterprise, Alibaba Group. Specifically, we define a core ontology to cover various abstract products and consumption demands, with fine-grained taxonomy and multimodal facts in deployed applications. OpenBG is an open business KG of unprecedented scale: 2.6 billion triples with more than 88 million entities covering over 1 million core classes/concepts and 2,681 types of relations. We release all the open resources (OpenBG benchmarks) derived from it for the community and report experimental results of KG-centric tasks. We also run up an online competition based on OpenBG benchmarks, and has attracted thousands of teams. We further pre-train OpenBG and apply it to many KG- enhanced downstream tasks in business scenarios, demonstrating the effectiveness of billion-scale multimodal knowledge for e-commerce. All the resources with codes have been released at \url{https://github.com/OpenBGBenchmark/OpenBG}.Comment: OpenBG. Work in Progres

    Laboratory and Experimental Hut Evaluation of a Long-lasting Insecticide Treated blanket for Protection against Mosquitoes.

    Get PDF
    Long-lasting insecticide treated blankets (LLIBs) may provide additional protection against malaria where use of long lasting insecticidal nets (LLIN) is low or impractical such as in disaster or emergency situations. Initial efficacy testing of a new candidate LLIB was carried out at LSHTM and KCMUCo, before and after washing, in cone and ball bioassays and arm-in-cage tests against pyrethroid susceptible Anopheles gambiae. A small scale field trial was conducted using veranda-trap experimental huts in northern Tanzania against wild An. arabiensis and Culex quinquefasciatus mosquitoes. Treatments included unwashed and 5 times washed permethrin treated LLIB and blankets hand-treated with permethrin (ITB), untreated blankets, and a holed unwashed Olyset net. Cone test mortality was 75% for LLIB when unwashed, but decreased to 32% after 5 washes and <10% after 10 washes. In arm-in-cage tests protection against biting was 100% for LLIBs regardless of the number of washes while reduction in landings was 79% when unwashed, 75% after 5 washes, but declined to 41% after 10 and 33% after 20 washes. In ball bioassays using pyrethroid resistant An. arabiensis, mortality was low in all treatments (<35%) and there was no significant difference in mortality between Olyset net, LLIB or ITB (p > 0.05). Percentage mortality of An. arabiensis in huts with LLIB unwashed (26%) was not statistically different to Olyset net (31%, p = 0.5). The 5 times washed LLIB reduced blood-feeding by 49% which was equivalent to Olyset net (p > 0.086). There was no significant difference in percentage blood-feeding between LLIB and ITB unwashed or 5 times washed (p = 0.147 and p = 0.346 respectively). The 5 times washed LLIB reduced blood-feeding of Culex quinquefasciatus by 40%, although the Olyset provided the greatest protection with 85% inhibition. ELISA analysis of a sub-sample of blood fed mosquitoes showed that not all had fed on humans in the huts, therefore blood-feeding inhibition may have been underestimated. This trial demonstrated the potential of LLIBs to provide substantial personal protection even against pyrethroid resistant mosquitoes. LLIBs may prove particularly useful where LLINs are unsuitable or net usage is low

    CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

    Full text link
    Artificial Intelligence (AI), along with the recent progress in biomedical language understanding, is gradually changing medical practice. With the development of biomedical language understanding benchmarks, AI applications are widely used in the medical field. However, most benchmarks are limited to English, which makes it challenging to replicate many of the successes in English for other languages. To facilitate research in this direction, we collect real-world biomedical data and present the first Chinese Biomedical Language Understanding Evaluation (CBLUE) benchmark: a collection of natural language understanding tasks including named entity recognition, information extraction, clinical diagnosis normalization, single-sentence/sentence-pair classification, and an associated online platform for model evaluation, comparison, and analysis. To establish evaluation on these tasks, we report empirical results with the current 11 pre-trained Chinese models, and experimental results show that state-of-the-art neural models perform by far worse than the human ceiling. Our benchmark is released at \url{https://tianchi.aliyun.com/dataset/dataDetail?dataId=95414&lang=en-us}

    Perceived dental treatment need among older Tanzanian adults – a cross-sectional study

    Get PDF
    Need perceptions for dental care play a key role as to whether people in general will seek dental care. The aim was to assess the prevalence of perceived need of problem based dental care, dental check-ups and any type of dental care. Guided by the conceptual model of Wilson and Cleary, the relationship of perceived need for dental care with socio-demographic characteristics, clinically defined dental problems and self-reported oral health outcomes was investigated. Partial prosthetic treatment need was estimated using a socio-dental approach. A cross-sectional survey was conducted in Pwani region and in Dar es Salaam in 2004/2005. Information from interviews and clinical examination became available for 511 urban and 520 rural adults (mean age 62.9 yr). 51.7% (95% CI 46.2, 57.0) urban and 62.5 % (95% CI 53.1, 70.9) rural inhabitants confirmed need for dental check-up, 42.9% (95% CI 36.9, 48.9) urban and 52.7% (95% CI 44.5, 60.6) rural subjects confirmed need for problem oriented care and 38.4% (95% CI 32.4, 44.6) urban versus 49.6% (95% CI 41.8, 57.4) rural residents reported need for any type of dental care. Binary and ordinal multiple logistic regression analyses revealed that adults who reported bad oral health and broken teeth were more likely to perceive need for dental care across the three outcome measures than their counterparts. Socio-demographic factors and clinically defined problems had less impact. Based on a normative and an integrated socio-dental approach respectively 39.5% and 4.7% were in need for partial dentures. About half of the participants confirmed need for problem oriented care, dental check-ups and any type of dental care. Need perceptions were influenced by perceived oral health, clinically assessed oral problems and socio-demographic characteristics. Need estimates for partial denture was higher when based on clinical examination alone compared to an integrative socio-dental approach
    • …
    corecore